Unified Named Entity Recognition as Word-Word Relation Classification

نویسندگان

چکیده

So far, named entity recognition (NER) has been involved with three major types, including flat, overlapped (aka. nested), and discontinuous NER, which have mostly studied individually. Recently, a growing interest built for unified tackling the above jobs concurrently one single model. Current best-performing methods mainly include span-based sequence-to-sequence models, where unfortunately former merely focus on boundary identification latter may suffer from exposure bias. In this work, we present novel alternative by modeling NER as word-word relation classification, namely W^2NER. The architecture resolves kernel bottleneck of effectively neighboring relations between words Next-Neighboring-Word (NNW) Tail-Head-Word-* (THW-*) relations. Based W^2NER scheme develop neural framework, in is modeled 2D grid word pairs. We then propose multi-granularity convolutions better refining representations. Finally, co-predictor used to sufficiently reason perform extensive experiments 14 widely-used benchmark datasets overlapped, (8 English 6 Chinese datasets), our model beats all current top-performing baselines, pushing state-of-the-art performances NER.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective Word Representation for Named Entity Recognition

Recently, various machine learning models have been built using word-level embeddings and have achieved substantial improvement in NER prediction accuracy. Most NER models only take words as input and ignore character-level information. In this paper, we propose an effective word representation that efficiently includes both the word-level and character-level information by averaging its charac...

متن کامل

Transliterated Named Entity Recognition Based on Chinese Word Sketch

One of the unique challenges to Chinese Language Processing is cross-strait named entity recognition. Due to the adoption of different transliteration strategies, foreign name transliterations can vary greatly between PRC and Taiwan. This situation poses a serious problem for NLP tasks: including data mining, translation and information retrieval. In this paper, we introduce a novel approach to...

متن کامل

Deep learning with word embeddings improves biomedical named entity recognition

Motivation Text mining has become an important tool for biomedical research. The most fundamental text-mining task is the recognition of biomedical named entities (NER), such as genes, chemicals and diseases. Current NER methods rely on pre-defined features which try to capture the specific surface properties of entity types, properties of the typical local context, background knowledge, and li...

متن کامل

Semi-supervised Bio-named Entity Recognition with Word-Codebook Learning

We describe a novel semi-supervised method called WordCodebook Learning (WCL), and apply it to the task of bionamed entity recognition (bioNER). Typical bioNER systems can be seen as tasks of assigning labels to words in bioliterature text. To improve supervised tagging, WCL learns a class of word-level feature embeddings to capture word semantic meanings or word label patterns from a large unl...

متن کامل

Chinese Word Segmentation and Named Entity Recognition by Character Tagging

This paper describes our word segmentation system and named entity recognition (NER) system for participating in the third SIGHAN Bakeoff. Both of them are based on character tagging, but use different tag sets and different features. Evaluation results show that our word segmentation system achieved 93.3% and 94.7% F-score in UPUC and MSRA open tests, and our NER system got 70.84% and 81.32% F...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i10.21344